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Abstract 

The problem of broadcasting a parallel Gaussian source over an additive white Gaussian noise broadcast 
(<~^ channel under the mean-squared error distortion criterion is studied. A hybrid digital-analog coding strategy which 

psj combines source coding with side information, channel coding with side information, layered source coding, and 

superposition broadcast channel coding is presented. When specialized to the open problem of broadcasting a white 
Gaussian source over an additive white Gaussian noise broadcast channel with bandwidth mismatch which has been 
*~~5 the subject of several previous investigations, this coding scheme strictly improves on the state-of-the-art. 

Keywords: Source-channel coding, source broadcasting, parallel Gaussian source, broadcast channel, 
hybrid-digital-analog, Wyner-Ziv, Gel'fand-Pinsker, dirty-paper coding, MMSE estimation, bandwidth 
mismatch. 

O I. Introduction 

In this paper, we study the problem of broadcasting the same source to a set of receivers over a common 
^ channel. The objective is to devise an encoding strategy (which utilizes a common power and channel 
f — bandwidth resource) to simultaneously deliver different qualities of service depending on the quality of 
O the channel experienced by the receivers. 

For the problem of transmitting a memoryless Gaussian source over a memoryless additive Gaussian 
noise point-to-point channel {i.e., when there is only one receiver) operating at the same symbol rate (in 
other words, when the memoryless source and channel have the same bandwidth), Goblick recognized 
that the uncoded transmission strategy of transmitting the source samples scaled so as to meet the average 
encoder power constraint, followed by the optimal linear minimum mean-squared error estimation of 
the source samples from the channel observations at the receiver, results in the optimal delivered quality 
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> 
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^ (measured in mean-squared error (MSB) distortion) [1]. Since the transmitter remains the same irrespective 
of the channel noise variance, the same strategy is optimal even when there are multiple receivers {i.e., 
it is optimal for broadcasting over a memoryless Gaussian broadcast channel [2]). However, the problem 
remains open when the source and channel bandwidths are mismatched [3], [4], [5], or more generally, 
when the source has memory [6]. 

One obvious digital approach to the bandwidth mismatched problem is to use the classical separation 
method of scalable source coding [7] followed by a degraded-message-set broadcast channel coding [8]. 
In this approach, a coarse source layer is communicated as a common message intended for all users and 
a refinement layer is communicated only to some of the users. For the two-user problem, employing this 
simple scheme, we make the following observation (which is proved in the appendix |I]) 
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rpuri@eecs.berkeley.edu. Kannan Ramchandran is with the Wireless Foundations, Dept. of EECS, University of California, Berkeley, CA 
94702, USA, e-mail: kannanr@eecs.berkeley.edu. 
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Proposition 1: For a memoryless Gaussian source, and a memoryless Gaussian broadcast channel, the 

gap in mean- squared error distortion (measured in dB) achieved by the optimal point-to-point source- 
channel coder and that achieved by the separation approach above can be upperbounded by a constant 
which depends only on the bandwidth mismatch. In particular, 



where Di and D2 are the MSE distortions incurred by the receivers 1 and 2 respectively, £)°p™'^ is the 
optimal point-to-point distortion to receiver k, k = 1,2, and BWsource and BWchannei are the source and 
channel bandwidths respectively. 

The above proposition upperbounds the gap between the trivial lowerbound on the distortion (namely, 
the optimal point-to-point distortion) and the distortion achieved by the separation scheme. The gap to 
optimality of the separation scheme for any number of users was studied recently in [9]. Better achievable 
strategies have been proposed by Mittal and Phamdo in [4] and for the bandwidth expansion case (where 
the bandwidth of the channel is larger than that of the source) in [3] and more recently by Reznic, 
Feder and Zamir in [5]. Also, an improved upperbound for the bandwidth expansion case is available 
in [5]. In this paper, we consider a slightly more general problem which allows for multiple independent 
source components and propose an improved achievable strategy. When our solution is specialized to the 
memoryless source and memoryless channel setting with bandwidth mismatch, it improves the methods 
proposed in [3], [4]. We would also like to point out that for the special case of bandwidth expansion, our 
scheme essentially matches the proposal of Reznic, Feder, and Zamir [5]. We obtain a slight improvement 
over [5] through a generalization overlooked there. 

For the case of a memoryless Gaussian source communicated over an additive memoryless Gaussian 
noise broadcast channel operating at the same symbol rate, the observation of Goblick mentioned earlier 
gives a rather simple optimal scheme - transmit the source samples scaled so as to meet the average 
encoder power constraint, and the receivers perform the optimal linear minimum mean-squared error 
estimation of the source samples from their respective channel observations. This illustrates an interesting 
feature of analog methods - their ability to enable simultaneous enjoyment of the power and bandwidth 
resource by each of the broadcast receivers. 

On the other hand, the obvious digital approach to the above problem is to use the classical separation 
method of scalable source coding [7] followed by degraded message-set broadcast channel coding [8]. In 
this approach, the coarse source layer is communicated as a common message intended for all users and 
the refinement layer is communicated only to some of the users. Consider a white Gaussian broadcast 
channel with two receivers; let us call the one with the lower noise variance strong and the other weak. 
While the common portion of the information, being limited by the weak receiver, is sub-optimal for the 
strong receiver, the refinement portion is completely unusable by the weak receiver and in fact acts as 
interference to it. Thus unlike the analog methods, the digital approach necessarily involves a "splitting" 
of the total system resource. 

While the above discussion illustrates the power of analog methods, real-world sources are characterized 
by a high degree of memory, and thus they are far from the memoryless model. Parallel source models 
describe these sources more effectively than a memoryless model. For this case, in a point-to-point set-up, 
analog transmission is sub-optimal in general. For a parallel Gaussian source with m source components 
and an additive memoryless Gaussian noise channel model with an equal number of sub-channels, the loss 
in performance of the analog approach with respect to the digital approach for sufficiently large transmit 
powers can be shovm to be 





(1) 



where (j| denotes the variance of the j-th source component [10], [11]. Thus, this gap grows with the 
memory of the source and can be arbitrarily large. 
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This motivates the main question posed in this paper: what is an efficient way to broadcast parallel 
Gaussian sources over memoryless two-user Gaussian broadcast channels? Our solution is driven by aiming 
to extract the best of both the analog and the digital worlds. We do this by invoking a hybrid uncoded- 
coded strategy, where the coded system uses a combination of the tools of successive refinement source 
coding [7], source coding with side-information or Wyner-Ziv (WZ) coding [12], super-position broadcast 
channel coding [2], and channel coding with side-information or Gel'fand-Pinsker (GP) coding [13] or 
dirty -paper coding [14]. We would like to point out that this remains an open problem in general and we 
present an achievable strategy which constitutes the state-of-the-art to the best of our knowledge. 

In the next section we present the problem setup. We then proceed by first considering two special 
cases in section |III| (i) when the weak user obtains its point-to-point optimal performance, and (ii) when 



the strong user obtains its point-to-point optimal performance. In the first case, we design a hybrid analog- 
digital scheme by modifying the ideas of successive refinement source coding and superposition channel 
coding. The idea which was presented in a conference version of this paper [6] was also independently 
explored in [15] where a point-to-point setting with a memoryless source and channel without any 
bandwidth mismatch was considered. In the second case, we present a hybrid scheme based on the ideas of 
source coding with side-information and channel coding with side-information. Section |IV] then considers 
a scheme which combines all these ideas to obtain a trade-off between the qualities of reproductions 
at the receivers. We take up the special case of memoryless Gaussian source and memoryless Gaussian 
broadcast channel with bandwidth mismatch in section |V} We conclude with some comments on potential 
directions of research. 



Si 

S2 



Sk 




Fig. 1. Problem setup. 



II. Problem setup 

The setup is shown in Fig. |lj In this paper, we only consider the case where all the sub-channels to a 
particular user have the same statistics. The upshot of this assumption is a simplification which results 
from recognizing that there is an ordering of the users according to the noise variance of their channel. 
We will call the user with the smaller noise variance the strong user, and the user with the larger noise 



4 



variance the weak user^ Note that the ideas presented in this paper also apply to the case of parallel 
broadcast channels where the different sub-channels to the same user may not have the same statistics, 
but we do not explore them here. 

We let our source have K independent components. The A;-th component is denoted by Sk{i) and it is 
independent and identically distributed (i.i.d.) over the time index i = 1,2, . . .. Our source is Gaussian, 
Sk{i) ~ A/'(0, cr^). Without loss of generality, we will assume that > (t| > . . . > (y\. We will explicitly 
model the fact that the source and channel bandwidths do not necessarily match. We will assume that 
there are M parallel broadcast sub-channels of the same statistics. When M = K, the bandwidth of the 
source matches the bandwidth of the channel. The weak user observes yu,m(i) = Xmii) + Z^mi'^), = 
1,2, .. . ,M, i = 1,2, . . ., and the strong user observes Ys„^(i) = Xmii) + Zsm{i), where Xmii) is the 
input to the m-th sub-channel. The noise processes are Z^mii) and Zsm{i) independent i.i.d. (over m 
and i) Gaussians with variances A^^^, and A^^ respectively, where Ng < Nw The source-channel encoder 
jn . ^Kn _^ maps an n-length block of the source to an n-length block of the channel input. There 

is an average power constraint on the encoder so that ^XliLi X]m=i "^m(O) /{nM) < P. The source- 
channel decoders g'^ : M*^" — )■ M^" and g"!^ : M*^" — )• M^" at the strong and weak user, respectively, 
reconstruct n-length blocks, 

\^Ssk{i), i = l,...,n,k = l,...,K^ = c/," ({^^(i), i = 1, 2, . . . , n, m = 1, 2, . . . , M}) , and 

^Sy,k{i), i = l,...,n,k = l,...,K^ = gl{{ywm{i), i = l,2,...,n, m = 1,2, ...,M}) 

of the source from n-length blocks of the channel outputs. Distortions are measured as the average of the 
mean-squared error distortion over all source components 

^ n K 2 

= ;^ E E (^^(^) - ^^.(^)) ' 3 ^ 

i=l k=l 

A pair of distortions (Ds, D^) will be said to be achievable if for any e > 0, for sufficiently large n, there 
is {f"',gs,gZ) such that < Dj + e, j G {s,w}. The problem is to characterize the region of all the 
achievable distortions {Ds,D^) for a given transmit power P. This remains open. In the next sections, 
we present our inner bound to the region (i.e., an achievable region). 

in. An achievable solution: the extreme points 

In this section we present all the key ideas involved in our achievable strategy. We consider two extreme 
cases - when the weak user achieves its point-to-point optimal quality, and when the strong user achieves 
its point-to-point optimal quality - to illustrate two complementary strategies which together constitute 
the general achievable solution. 

A. Weak-user-optimal case 

From Shannon's separation theorem [16, pg. 216] we know that the optimal solution for the point-to- 
point source-channel problem can be obtained by the separation principle of first optimally source coding 
and then transmitting the resulting bit stream using an optimal channel code. Thus the lowest distortion D^, 
attainable by the weak user is given by "reverse water- filling" over the source components [16, pg. 348] 

'it is known that the performance of such a broadcast channel is identical to that of a degraded broadcast channel where the weak user 
receives the signal received by the strong user but further corrupted by an additive Gaussian noise independent of the additive noise corrupting 
the strong user's channel and which has a variance equal to the difference between the variances of the additive noises of the weak user and 
the strong user in the original channel. This fact is often expressed by saying that the original broadcast channel is stochastically degraded. 
A proof of the above stated equivalence for the case of channel coding appears in [16, pg. 422]. The same idea can be used to show an 
equivalence for the problem of interest here though we do not need to make use of this equivalence in our discussion here. 
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1 ( u, if /i < al, 

^ ^ I Or^, if /i > (^i, 

where /i is chosen such that the total rate (1/2) J2k=i log('^fe/-Dfc) equals the capacity Cuj of the weak user's 
channel. is in turn given by (1/2) X]m=i log(l + -Pm/^to) where we choose Pi = P2 = • • • = Pm = P- 
If this separation strategy is followed for the broadcast case as well, the strong user also recovers 
the source at a distortion However, without compromising the quality of reproduction for the weak 
user, better quality can be delivered to the strong user. Before presenting our solution in full generality, 
it is useful to consider the special case of K = M = 2 and al > a^; see Fig. |3] Let us suppose that 
the optimal point-to-point reverse water-filling solution for the weak user allocates distortions Di and 
D2 for the source components 5*1 and S2, respectively. Also, let us denote the powers allocated to the 
sub-channels Xi and X2 by Pi and P2 respectively. Pi = P2 = P. Then 

1, 1, fPi + N^P2 + Nu 

log 





an 


= ^log( 




D2J 





2 °VAAy 2 °v 

We can source code Si using a successive refinement strategy thereby producing two bit streams: a coarse 
description at distortion D[ and a refinement stream which refines from D[ to Di. Since Gaussian sources 
are successively refinable [7], this can be done without loss of optimality for the weak user. We choose D[ 
such that the bitrate of the refinement stream is equal to the rate at which the first sub-channel operates. 
i.e., 

fP>'A 1, / Pi 

Combining the two equations above gives 

- log -^—] = - bg 1 + — 

In other words, without loss of optimality, we may send the coarse description for Si and the bit 
stream for S2 over the second sub-channel, and the refinement bitstream for Si over the first sub- 
channel; see Fig. [2]^a). This further suggests that instead of sending the refinement bitstream over the 
first sub-channel, we may send uncoded the quantization error resulting from the coarse quantization 
of ^i appropriately scaled to satisfy the power constraint. The input to the first sub-channel will be 

■\J {Pi/D[) iSi{i) — S[{i) ], where S[{i) is the z-th sample of the coarsely quantized version of Si. It is 

easy to see that this satisfies the power constraint on the first sub-channel and also results in no loss of 
optimality for the weak user. The second fact is analogous to the optimality of uncoded transmission for 
the point-to-point Gaussian source-channel problem. The strong user can achieve a lower distortion on 5*1 
because of the uncoded transmission of the quantization error. The strong user estimates the refinement 
component as (Pi /(Pi + Ns))\/D[/PiYi and adds it to the coarse description to form its reproduction 
of 5*1. We note that the resulting distortion for the strong user on Si is D[/(l + Pi/ Ns). 

The performance of the strong user can be further improved. Without losing optimality for the weak 
user, we may send the coarse description of Si and the bit stream for S2 using superposition coding 
over the second sub-channel. In particular, we send the coarse quantization bit stream of Si using power 
P2 — P2 (defined below) and the bit stream for 5*2 using power Pg such that the decoder can first decode 
the former bit stream assuming the latter as interference. The decoder then cancels the interference from 
the bit stream for 5*1 and decodes the bit stream for 5*2. Thus P2 is given by 
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^log 


= ^log( 
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Fig. 2. Weak-user-optimal case: (a) separation scheme showing successive refinement (SR) and superposition coding, and (b) the hybrid 
digital-analog scheme. 
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Fig. 3. Weak-user-optimal case: The schematic diagram shows the allocation for a = M = 2 example. The optimal coded separation 
scheme for the weak-user may be thought of as sending source-coded bits about the second source component 52 and the coarse-layer bits 
from a successive refinement source coding of the first source component over the second sub-channel using superposition coding, and 
the refinement-layer bits from the successive refinement coding of Si alone over the first sub-channel. An equivalent distortion performance 
can be achieved at the weak user while improving the strong user's performance by (i) sending the quantization error from the coarse 
quantization of 5*1 uncoded (scaled) over the first sub-channel, and (ii) sending 5*2 uncoded (scaled) over the second sub-channel and the 
coarse-layer bits on Si channel coded with sufficient power to be decoded (by the weak user) treating the uncoded transmission of S2 as 
noise. The strong user benefits from both the uncoded transmissions since it can form better quality estimates than the weak user. 
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This also gives the relation 

which indicates why decoding of the bit stream for 5*2 after interference cancellation succeeds. This 
scheme suggests that we may send 5*2 uncoded using power P!^ instead of sending its quantized bits. 
Since, after canceling the interference from the 5*1 bit stream, the channel to the weak user is an additive 
white Gaussian noise channel, the optimality of this scheme follows from the optimality of uncoded 
transmission for point-to-point Gaussian source-channel coding. The strong user can now reconstruct S2 
at a lower distortion, -D2(l + P2/^w)/{^ + PU^s)- The scheme is summarized in Fig. ^^). The overall 
distortion achieved by the strong user is 

Ds = 

The extension to = M > 2 is straightforward. Let us assume without loss of generality that under 
the point-to-point optimal inverse water-filling solution for the weak user, the first L source components 
satisfy (1/2) log((T^/Dfe) > (l/2)log(l + Pk/N^). For these L components we define such that 
(1/2) log(D^/Dfc) = (1/2) log(l + Pk/Nw)- The A;-th such component {k < L) is source coded to a 
distortion of -D^ and the resulting error is sent uncoded (scaled by \fPkfD~k) over the fc-th sub-channel. 
For sub-channels m > L, we define P'^ as (1/2) log(cj^/D^) = (1/2) log(l + P'^/N^). The m-th 





such component is sent uncoded over the m-th sub-channel scaled by yP'ml'^Iri- The rest of the power 
{Pm — P'm) for these sub-channels m > L are used to send the source coded bits from the first L 
components. On these sub-channels, the decoders first decode these bits, cancel the interference caused 
by them, and then estimate the source components. The first L source components are estimated directly 
from the corresponding sub-channel outputs. Thus, without compromising the quality of reproduction for 
the weak user, the strong user achieves a lower distortion. 

Ds 

This scheme directly extends to the K ^ M case. When K < M (bandwidth expansion), the extra 
sub-channels will be used to send additional coded bits. For the case of K > M (bandwidth contraction), 
only at most M source components can be sent (wholly or partially) uncoded. In summary, we have the 
following 

Theorem 2: For the source-channel problem in Section |ll] [Ds^D^ is achievable, where D*^ is given 
by (|2]) and Dg is as defined below. 

K 

E 

k=K'+l 




\k=i 

where D^s are given by P^ = P, 



L = min 



Dk - 



,M 



and the P^'s are defined by 



K' = mm{\{k:fi<al}\,K}, 
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B. Strong-user-optimal case 

If the point-to-point separation approach is used for providing optimal fidelity to the strong user, since 
the rate of transmission is greater than the channel capacity of the weak user, the weak user will not be able 
to get any useful information. However, in this subsection we show that we can provide useful information 
to the weak user without compromising the strong user's performance. As will become clear, in this case, 
the weak user's receiver will only involve scaling the signals received on different sub-channels, i.e., it 
will be an analog receiver. An extension of the scheme in this subsection was shown in [11] to obtain 
the entire optimal distortion trade-off region for broadcasting a parallel Gaussian source over a parallel 
Gaussian broadcast channel to two receivers when one of the receivers is restricted to be a linear filter 
(but without no assumptions on the relative strengths of the channels to the receivers). The linear filter 
receiver is a model for a legacy analog receiver in a transitionary broadcast system which supports digital 
and analog receivers. 

Let the point-to-point optimal reverse water-filling solution for the strong user produce a total distortion 
from a distortion allocation D^, k = 1,2, . . . , K according to (|2]), where fi is now chosen so that the 
total rate equals the capacity Cs = (1/2) J2m=i log(l + Pm/Ns), Pm = P of the strong user's channel. 

It is again helpful to consider the special case of i^" = M = 2 and aj > o\ (see Fig. [s]) before the 
general case. Note that this example was also used as a starting point for presenting the scheme in [11, 
Section III]. We summarize the discussion below for completeness. With Pi = P2 = P, we have 

log 7^7^ = o log 



We source code using successive refinement (Fig. |4]^a)) such that now the bitrate of the coarse 
description at distortion D'{ is equal to the rate at which the first sub-channel operates. 

ilog(g) = ilog(l + | 

Note that this is different from the previous subsection where we set the bitrate of the refinement bit 
stream equal to the rate of the first sub-channel. Thus 

1 fD'lal\ 1 / P2 
- loff — — = - loff IH 

Instead of sending the coarse description over the first sub-channel and the refinement bit str eam on 
the second, without losing optimality for the strong user, we may send 5*1 uncoded (scaled by ^J~pJ~a\) 
over the first sub-channel and on the second sub-channel a Wyner-Ziv bit stream (of rate equal to that 
of the refinement bit stream) which assumes that the corrupted version of from the first sub-channel 
will be available as side information at the decoder. The optimality of this scheme follows from the no 
rate-loss property of jointly Gaussian sources under Wyner-Ziv coding [12]. 

Thus the weak user can now form an estimate of S\ from its output Y\ of the first sub-channel. It is also 
possible to provide the weak user with an estimate of the second component without losing optimality 
for the strong user. Let us define P'^ such that the Wyner-Ziv bit stream can be sent using this power, 
superposition coded with the bit stream for 5*2 which uses the rest of the power P2 — P2. The decoding 
order is first the bit stream for 5*2, followed by the Wyner-Ziv bit stream. Thus 

1 + ^ 

N. 





(f) 






= ^log( 









and 



- bg ^ = - bg 1 + ^ 



We can use dirty-paper coding of Gel'fand and Pinsker [13] and Costa [14] instead of superposition coding 
to achieve the same rates. Here the channel codeword for the bit stream of S2 is treated as non-causal side 
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Fig. 4. Strong-user-optimal case (also see [11, Fig. 2]): (a) separation scheme showing successive refinement (SR) and superposition coding, 
(b) separation scheme with Wyner-Ziv (W-Z) code and dirty-paper coding (DPC), (c) the hybrid digital-analog scheme. 



information available at the encoder when channel encoding the Wyner-Ziv bit stream. Fig. ^b) shows 
this setup which uses a combination of dirty-paper coding and Wyner-Ziv coding. Note that the decoder 
does not need to decode the bit stream for 5*2 in order to decode the Wyner-Ziv bit stream. This allows 
us to send S2 uncoded (scaled by a/ (-P2 — -^20/^2) without impacting the optimality for the strong user 
(Fig. Qc)). The weak user can now form an estimate of S2 from Y2. 

Again, we can easily extend the above intuition to K = M > 2. Let L be the number of source 
components and sub-channels such that (1/2) log((T^/Dfc) > (l/2)log(l + Pk/Ng). Since al are mono- 
tonically decreasing, these will be the first L components. For these components, we define D'l such 
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Fig. 5. Strong-user-optimal case: The schematic diagram showing the allocation for a K — M = 2 example. The optimal coded separation 
scheme for the strong user may be thought of as sending the coarse-layer bits from a successive refinement source coding of the first source 
component alone over the first sub-channel, and source coded bits of the second source component S2 and the refinement bits of Si 
over the second sub-channel. However, none of these bits are decodable by the weak user We may provide useful information without 
compromising the strong user's performance by (i) sending Si uncoded (scaled) over the first sub-channel and (ii) sending ^2 uncoded 
(scaled) over the second sub-channel and bits carrying refinement information on Si using Gel'fand and Pinsker's (dirty-paper) channel 
coding where the transmission of 5*2 acts as Gaussian side-information at the transmitter Dirty-paper coding ensures that the transmission of 
S2 does not affect the rate of transmission of the refinement bits. The bits carrying the refinement information are produced using Wyner-Ziv 
coding where the noisy observation of 5*1 over the first sub-channel acts as side-information at the decoder. While the weak user will be 
unable to decode the refinement information, it benefits from the two uncoded transmissions. 



that {l/2)\og{al/D'^) = (l/2)log(l + Pk/Ns). For the rest of the sub-channels (m > L) we define 
P'^ by (1/2) log(or^/D^) = (1/ 2) log(l + {Pm - P'^)/{P'^ + Ns)). The first L source components are 
sent uncoded scaled by \/Pkl ^\ on their corresponding sub-channels, and rest of the source components 
are sent uncoded scaled by a/ [Pk — Pk) / crl on their corresponding sub-channels. The first L source 
components are Wyner-Ziv source coded at rates of {l/2)log{D'l/Dk) assuming the availability at the 
decoder (strong user) of the noise corrupted versions sent over the corresponding sub-channels. These 
source coded bits are sent using dirty-paper coding over the rest of the sub-channels m > L. The resulting 
distortion for the weak user is 

D^ = 

which is strictly less than J2k=i the distortion for the weak user in the separation approach under 
which no information is decodable by this user. 

This scheme also directly extends to the bandwidth expansion {K < M) and bandwidth contraction 
{K > M) scenarios. To summarize, we can state the following 

Theorem 3: The distortion pair {Dl^D^) is achievable for the source-channel coding problem in 
section |Il| where the point-to-point optimal distortion for the strong-user and are as follows. 

1 ^ f if M < cr^5 

^ = E ^^^^^ = -f \ 2 (3) 




k 



where ji is chosen such that the total rate (1/2) ^^^^^ log(cr|/i5fc) equals the capacity Cs of the strong 

user's channel. Cs = Y,m=i log(l + Pm/Ns) where Pi = P2 = . . . = Pm = P. 
Let 



mm 



Dk - Ns 
K' = min {\{k : fi < al}\,K} 



M 
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and the P^"s be defined by 



Then, 



Source coded 
(Coarse layer) 



Dr. 



1 + 



P - P" 
P" -L ]V ' 



Wyner-Ziv coded 



K' 2 \ 

=L+1 -L + k=K' / 




Channel coded 



Dirty-paper coded 



SOURCE 



CHANNEL 



Fig. 6. An achievable trade-off: Tiie scliematic diagram sliows an allocation for a K — M — 2 example. The first source component 
Si is sent in three different ways: (i) a coarse layer source codeword which will be decoded by both users, (ii) an uncoded version of 
the quantization error which the weak user will estimate from its noisy observation, and (iii) Wyner-Ziv bits on Si which only the strong 
user decodes. In decoding the Wyner-Ziv codeword, the strong user uses as side-information a linear estimate of Si using (i) and its noisy 
observation of (ii). The quantization error (ii) is sent uncoded (scaled) over the first sub-channel using all the power allocated to this 
sub-channel. Over the second sub-channel, the coarse layer bits from (i) above are sent using a Gaussian channel code using part of the 
power allocated to this sub-channel. This is meant to be decoded by both users treating the rest of the signals sent over this sub-channel as 
noise. Using part of the leftover power, the second source is sent uncoded (scaled). Since the codeword carrying bits from (i) is assumed 
to be successfully decoded by both users, they may estimate the second source component assuming only the rest of the power used in 
this sub-channel and the channel noise as the disturbance affecting this transmission. The leftover power in this sub-channel is used to send 
the Wyner-Ziv bits from (iii). This is done using Gel'fand-Pinsker's (dirty-paper) coding treating the scaled version of the second source 
component sent over this sub-channel as side-information (at the transmitter). 



IV. An achievable trade-off 

We may also trade-off the quality of reproductions at the two users without being optimal for either. 
Clearly, time sharing between the two achievable extreme points is a possibility. It is often possible to 
do better. A natural strategy suggested by the above discussion is to combine the schemes for the weak- 
and strong-user-optimal cases. 

Again, we will first consider a K = M = 2 example with af > cr|. Let us suppose that the power 
allocation to the two sub-channels are Pi and P2 such that Pi + P2 = 2P. Consider Fig. [6j The first 
source component 5*1 is sent in three different ways: 
(i) A (coarse layer) source codeword is formed which is meant to be decoded by both users. The source 
coded bits are sent over the second sub-channel using a Gaussian channel code utilizing part of the 
power P2 allocated to this sub-channel. This is meant to be decoded by both users treating the rest 
of the signals sent over this sub-channel as noise. Let us denote the quantized version of 5" by 5*^" 
and MSB by D[. The rate of this coarse layer source codebook is then 
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(ii) An uncoded (scaled) version of the quantization error resulting from the source coding in (i) is 
transmitted. This transmission occurs over the first sub-channel using all the power Pi allocated to 
this sub-channel, i.e., 

1^ 




The weak user estimates S*" from the codeword S'-^ in (i) and its noisy observation of the quantization 
error over the first sub-channel using a linear estimator 



s: 



Pi 



Wl 




The resulting MSE is 



1 + 



Pi_ 



(5) 



(iii) Wyner-Ziv bits on are sent intended for the strong user alone. In decoding the Wyner-Ziv 
codeword, the strong user uses as side-information a linear estimate of S*" it forms using (i) and 
its noisy observation of (ii) in a manner similar to the weak user's estimate of above. Let us 
denote the strong user's estimate of 5" by S"" and the MSE of this estimate by D'(. We have 



lln 




and 



(6) 



(7) 



Then, by Wyner-Ziv's theorem, the bit rate needed to achieve a distortion of Di on the source S*" 
at the strong user with S*"" acting as side-information i^ 



- loe 
2 ^ Di 



(8) 



The transmission of these Wyner-Ziv bits occurs over the second sub-channel as described below. 
The transmission over the second sub-channel is meant to deliver the coarse layer source codeword bits 
about 5*1 and an estimate of 5*2 to the weak user, and in addition to these, the Wyner-Ziv bits about Si 
as well to the strong user. This is accomplished as follows: 
(a) The source codeword bits from (i) are transmitted using a Gaussian channel code utilizing power 
P2 — P2 (a part of the total power P2 allocated to this sub-channel). This is decoded by both users 
treating the rest of the signals sent over this sub-channel as noise. The decoding at the weak user 
presents the bottleneck to the rate at which bits may be delivered. Hence, to meet the rate required 
by (Q, we must have D[, P2, and P2 satisfy 



log 



err 



1 



log 1 + 



P2-P^ 

PUN^ 



(9) 



^However, note that by ([6|, our side-information S"" is not S" corrupted by a memoryless Gaussian disturbance as a classical statement 
of Wyner-Ziv's theorem would require. All we are guaranteed is that the side-information has a MSE of D" with respect to the source. A 
simple extension of the achievability proof can handle this situation ~ we may invoke the achievability part with S" and S'^" acting as the 
(vector) symbols; see, for instance, [5, Appendix IV]. 
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(b) The second source component ^2 is sent uncoded (scaled) using power P2 — P2 (a part of the power 
P2 leftover after (a)), i.e., we send a/ (-P2 ~ P2)/^2S2- Since the codeword from (a) is assumed 
to be successfully decoded by both users, they may strip it off their received signals Y^^ and F^" 
and estimate the second source component assuming only the rest of the power used in this sub- 
channel and the channel noise as the disturbance affecting this transmission. Both users employ 
linear estimators. Let us denote the signals after the codeword from (a) has been stripped off by 
1^" and Y!.'^ at the weak user and the strong user, respectively. Then, the estimates of S2 are 




The MSB D2 on the second source component incurred by the strong user is given by 

Mogf#Uli„gfi + ii:#V (10, 



Similarly, the weak user incurs an average distortion on the second source component of 



1 _i_ ^2 ^2 

(c) Finally, the Wyner-Ziv bits from (iii) are transmitted intended for the strong user with the leftover 
power of P2. Let us recall that we have assumed that the codeword from (a) is successfully decoded 
by the strong user and stripped off its received signal. The only other disturbances affecting the 
transmission of the Wyner-Ziv bits are the memoryless Gaussian noise in the channel and the scaled 
transmission of the memoryless Gaussian S2 in (b), both of which are independent of each other and 
the Wyner-Ziv bits being sent. Also, the disturbance ^2 is known to the transmitter non-causally. 
This is precisely the setting of Gel'fand-Pinsker or dirty-paper coding by which a rate equal to the 
capacity of channel in which only the memoryless Gaussian noise is present can be achieved. This 
rate must be large enough to support the Wyner-Ziv bits whose rate is ([8]). Thus, P2, D'( and Di 
must satisfy 

'^o,m-b'Ji.f]. (11) 



2 \DiJ 2 °V N, 
To summarize, the decoders can achieve distortions of 

= (Z^i + D2)/2, and 



2 1 + 1 , 

for every choice of the non-negative power parameters Pi, and -P2 > -P2 — ^2 which satisfy the sum 
power constraint Pi + P2 = 2P, and non-negative distortion parameters Di < D'( < D[ < aj, and 
D2 < (y\ provided they satisfy the conditions (|5]), Q, (|9]), ([10]), and ([TT]). 



Generalizing the above, in general, we have the following achievable trade-off 

Theorem 4: Let L G {0, 1, . . . , min(ii', M)} and fC' G {L, L + . . . , min(ir, M)}. Also, let Pi, P2, . . ., 

Pmi -Pl+1) -Pl+2' • • •' -^Af' -^L+2' • • • ' ^K'^ -^1' -^2; • • • ; i D^, Z^g, • • • , -Dl, ^ K'+1^ ^K'+2^ • • • ' 

D'{, D2, . . ., D'l be non-negative such that the following conditions are satisfied 
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M 

Y,Pm< MP, (12) 

m=l 

P:^<Pm, m = L + l,L + 2,...,M, 
P^<P;, m = L + l,L + 2,...,K', 
Dk<Dl<D'^<al k = l,2,...,L, 

Dk<al k = L + 1,L + 2,...,K', 
Dk<D'^<al k = K' + 1,K' + 2,...,K. (13) 
The following {Dg, Dw) is achievable 

\k=l / 

2 ^ 




fe=L+l ^ + P,"+Ar,„ k=K'+l 



if the following conditions are satisfied 



fe=l k=K'+l ^ m=L+l ^ m ' w/ 

Elogf + t .og|< E >og(l + f)+ t log(l + |). (15) 

fc=l " k=K'+l m=L+l ^ ^ m=K'+l ^ * ^ 



The proof is relegated to appendix III 



In general, the above optimization problem appears to be computationally challenging for large values 
of K and M. However, simplification is possible for the important special case of memoryless sources 
and channels with bandwidth expansion and bandwidth contraction. This is taken up in the next section. 

V. Specialization to memoryless sources and channels with bandwidth mismatch 

A special case of the problem is when the source is also memoryless, but has a bandwidth different 
from the channel. If we define the degree of mismatch by o; = M/K, we have 

Theorem 5: For the special case of the problem in section |ll] with al = = . . . = aj^ = cr|, the 
following {Dg, Dyj) trade-off is achievable: 

• For a <1 (bandwidth contraction) 

{(Df (A,7),/^f (A,7)) :0<A<1,0<7<1}, 

where 

J-'w {^^1) = xp+M + r^, and 



\P+Nu 



D \ •y) = h 

X-vP+Ns f P+N^ X-yP+Ns 
' XP+N^ Ns 
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Unooded 




SOURCE CHANNEL 

Fig. 7. Memoryless source over memoryless channel with bandwidth contraction: Schematic diagram showing an allocation for bandwidth 
expansion factor a = 1/2. Note that we need not use Wyner-Ziv coding and may use successive refinement source coding as shown. 




Fig. 8. Memoryless source over memoryless channel with bandwidth expansion: Schematic diagram showing an allocation for bandwidth 
expansion factor a — 2. Note that we need not use Gel'fand-Pinsker's (dirty-paper) coding and may use superposition channel coding as 
shown. 



• For a > 1 (bandwidth expansion) 

{(Df(A,7),/^f(A,7)) :0<A<1, 0<7<1} 



where 



DTW = ^^r^ > and 



Df(A) 



Q-l 



Ot—1 iuyy \ /\ 
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We prove this as a special case of Theorem |4] in appendix |Ilj 

For bandwidth contraction, we use only successive refinement source coding and Gel'fand-Pinsker 
channel coding; see Fig. |7] In the bandwidth expansion case, only Wyner-Ziv coding and superposition 
decoding is used (Fig. [8]). As pointed out earlier, many researchers have investigated this special case. 
For the bandwidth expansion case, an almost identical scheme (using Wyner-Ziv coding and superposition 
coding) was presented in [5]. The trade-off expression above, but with 7 = 1/a (corresponding to a flat 
power allocation) appears in [5, Theorem 2]. However, as shown in Fig. and (c), the extra flexibility 
from non-flat power allocations can lead to slight gains. The bandwidth contraction case discussed above 
is new. The following remarks on the extreme points of these trade-offs are in order. 

• At the weak-user-optimal points, our achievable schemes under bandwidth contraction and bandwidth 
expansion reduce to the schemes proposed by Mittal and Phamdo [4]. 

• At the strong-user-optimal point under bandwidth expansion, our scheme reduces to the systematic 
lossy source-channel codes of Shamai, Verdu, and Zamir [3]. 

• At the strong-user-optimal points, the achievable scheme is strictly better than the solution offered 
by Mittal and Phamdo in [4]. The gap can be computed explicitly to be 

7 I , and 

(bandwidth expansion) ^ ^ . 

+ (1 + ^) 

As pointed out above, at the weak- user-optimal point, the schemes coincide. For the boundary 
points in between the strong-user-optimal and weak- user-optimal points, an explicit computation 
is cumbersome, but numerical computation over a wide range of settings suggest that the achievable 
scheme strictly out performs the schemes of Mittal and Phamdo. Fig. |9] shows a comparison of the 
trade-offs achieved by our scheme with those of Mittal and Phamdo [4] for a few examples. 
Outer bounds: Only one non-trivial outer bound is available in the literature for this problem. It is due 
to Reznic, Feder, and Zamir [5] who developed it for the case of memoryless source and channel under 
bandwidth expansion. This bound, however, does not match the best available inner bound described above. 
The same bounding technique can be used to derive outer bounds for the parallel source-channel problem 
considered here. However, it does not always lead to a non-trivial bound. For instance, for memoryless 
source and channel with bandwidth contraction, the technique yields the trivial bound (resulting from 
considering the point-to-point source-channel problems involving either the weak user or the strong users 
alone). 



(bandwidth contraction) 



VI. Conclusion 

We have presented a hybrid digital-analog scheme for the problem of sending a parallel Gaussian 
source over a white Gaussian broadcast channel which potentially has a bandwidth mismatch with the 
source. We used the concepts of successive refinement and Wyner-Ziv source coding, and superposition 
and dirty paper channel coding to show that without compromising the point-to-point optimal performance 
of either the weak or strong user, we can strictly improve the performance of the other user over what the 
conventional separation approach offers. We also showed how to achieve better trade-offs when neither 
user is point-to-point optimal. While we do not have a converse for our scheme, the achievable points 
are the best available. When specialized to the case of memoryless sources and channels with bandwidth 
mismatch, our scheme matches or in some cases outperforms the best known schemes. 

Only an achievable trade-off is available for two digital receivers with different channels. We believe 
that the limitation is primarily due to the lack of good outer bounds on the region of distortions that can 
be supported simultaneously. Even in simple cases like when the source and channel are memoryless, but 
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D^'^ in dB 



(a) 

Fig. 9. Comparison of distortion trade-offs achieved by our scheme with that of Mittal and Phamdo: (a) Bandwidth contraction. (t| = 
1, P/Ns = 20dB, P/Nw ~ OdB, a — 0.5. The best scheme suggested by Mittal and Phamdo was chosen for comparison [4, Fig. 14]. The 
dashed lines are drawn at the weak and strong user optimal distortions and thus give the trivial outer bound to the trade-off region. The 
strong-user-optimal points appear to coincide, but there is a small gap which is not visible at the scale of this plot. The weak user optimal 
points indeed coincide where both schemes reduce to the same scheme. The performance of the best separation-based scheme is also shown 
for comparison. 

with mismatching bandwidths, a tight result is not available. In fact, the best available outer bound for 
the case where the source bandwidth is larger than the channel bandwidth is the trivial outer bound which 
considers the receivers separately. It is also not clear if codes with more structure can be used to obtain 
better trade-offs. These could be subjects of further investigation. 
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Fig. 9. Comparison of distortion trade-offs aciiieved by our sciieme with tliat of Mittal and Piiamdo: (b) Bandwidtli expansion. (t| = 
1, P/Ns — 20dB, P/Nw — OdB, a = 2.0. Again the best scheme of Mittal and Phamdo for this setting was chosen for comparison [4, 
Fig. 12]. The dashed lines are drawn at the weak and strong-user-optimal distortions and thus give the trivial outer bound to the trade-off 
region. The strong-user-optimal points appear to coincide, but there is a small gap which is not visible at the scale of this plot. The weak- 
user-optimal points indeed coincide where the schemes reduce to the same scheme. The slight improvement over [5] from allowing non-flat 
power allocation is visible. The performance of the best separation-based scheme is shown for comparison. 



Appendix I 
Proof of Proposition [H 

We will use the notation introduced in section [n] with = (t| = . . . = (t|- = cr^. The proposition is 
a consequence of the following two facts: (i) the Gaussian source {Sk, k = 1, . . . , K} is successively 
refinable [7], and (ii) there is an operating point in the degraded message set rate region of the white 
Gaussian broadcast channel in which each user's rate is within 1-bit of its point-to-point capacity. To see 
the second fact, let us recall that the boundary of the degraded message set rate region (for each sub- 
channel) of the Gaussian broadcast channel is given by the set of rates {(-Rbase(/3), -Rrefine(/3)), /3 G [0, 1]}, 
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Fig. 9. Comparison of distortion trade-offs aciiieved by our scfieme with those of Mittal and Phamdo: (b) Bandwidth expansion. a| = 
1, P/Ns = 4dB, P/Nw = OdB, a — 1.5. Two schemes of Mittal and Phamdo (systems 2 and 3) together give the best performance of 
all the new schemes proposed in [4, Fig. 13]. The dashed lines are drawn at the weak and strong user optimal distortions and thus give 
the trivial outer bound to the trade-off region. The gap between the strong-user-optimal points is visible in this plot. The weak-user-optimal 
points coincide. The slight improvement over [5] from allowing non-fiat power allocation is again visible. The performance of the best 
separation-based scheme is shown for comparison. 



where the rate of the message decoded by both users is (in bits per symbol) 

«.„(« = ilog..(l + )i^' 

and the rate of the message decoded only by the strong user is 

1 



refine 



1 + ^^) 
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And, for a given (3 E [0, 1], the overall rate delivered to the users is 

RwW) = -Rbase(/3), 
RsiP) = -Rbase(/3) + -Rrefine(/3)- 

In order to show fact (ii), it is enough to show that there is a /3 G [0, 1] such that 

RUP) + '^>K = i?»(0),and 
Rs{(3) + 1>R:=Rs{1), 

where i?^ and i?* are the point-to-point capacities of the weak and strong users respectively. Simplifying 
the above two conditions, we get 

-Y>P> i/(M.)+2/(i + f) ~ 

Observing that the left-hand side is always larger than the right-hand side and that the right-hand side is 
always less than 1, we can conclude that that such a /3 G [0, 1] always exists. By choosing this operating 
point (-Rbase(/3)) -Rrefine(/3)) for the degraded message set broadcast channel code, and using the optimal 
layered (successive refinement) source code, the distortion pair (D^, D^) achieved by the user-j, j G {s, w} 
satisfies 

K 

-log,— = MR,{P) 

> M{R* - 1) 

= — logo M. 

2 ^ D* 

Thus, since M/K is the ratio of the channel bandwidth to the source bandwidth, we have proved that 

-'-1 ^3 ^ BWchannel r 
^ ^ j JJ vr source 

Appendix II 
Proof of Theorem [5] 

The proof for the bandwidth contraction case (a = M/K < 1) follows from the following choice of 
parameters in Theorem |4j L = 0, K' = M, and the power allocation is 

P,n = P, m = l,2,...,M, 
PL = XP, m = l,2,...,M, 
P^ = A7P, m = l,2,...,M, 

where A and 7 are in [0, 1]. Also, we let 

Dk = D, A: = 1, 2, . . . , M, 

Dk = D, k = M + l,M + 2,...,K, 

D'j^ = D\ k = M + l,M + 2,...,K, 
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where D,D, and D' are defined as below to satisfy the conditions of Theorem |4| 



I A(1-7)P 
D A7P + N, ' 



{K - M) log ^ = M log 



D' ° AP + AT, 



w 



(/f-M)log£: = Mlog'-'^ + ^' 



Substituting these in the expression for the achievable (Dg, D^) gives the result. 

The choice of parameters for the bandwidth expansion case {a = M/K < 1) is L = K' = K , and the 
power allocations are 

P^ = ^{-fMP) = ajP, m = l,2,...,K, 

P„ = — 1— ((1-7)MP) = ^^^i^P, m = K + l,K + 2,...,M, 
M — K a — I 

where 7 G [0, 1]. Clearly, we have J2m=i ^rn = MP as required. Also, we set 

P;„ = (1 - A)P^, m = K + 1, K + 2, . . . , M, 

where A G [0, 1], and D'^ = D' , = D" , Dk = D, k = 1,2, . . . , K, where D', D" , and D are chosen 
as follows to satisfy the conditions in Theorem |4j 

K\og^ = {M- K) log -, 

D' a-fP 

= 1 H ' — 

D" iV, ' 

D" X^^^P + Ns 

Klog— = {M-K)log 



These choices give the achievability result for bandwidth expansion 

Appendix III 
Proof of Theorem [4] 



The main ideas involved have already been described in section IV We sketch the main steps of the 
proof which use the following results: successive refinement source coding [7], source coding with side- 
information or Wyner-Ziv (WZ) coding [12], super-position broadcast channel coding [2], and channel 
coding with side-information or Gel'fand-Pinsker (GP) coding [13] (in particular, as applied by Costa to 
the Gaussian case [14]). 



The m-th sub-channel is allocated a power of P^ such that it satisfies the power constraint by ( [T2| ). The 
coding will be performed as usual on block-length n sequences of sufficient length that the source codes 
invoked below have distortions close to optimal and the channel codes have low probabilities of errors. 
For clarity, we will supress these small gaps as we did in the discussion of the K = M = 2 example in 
section I 
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a) Source components 1 through L: These source components are treated in a way similar to the 
first source component was in the K = M = example (Fig. |6]) of section IV in that they are sent in three 



different ways. An n-length block of the A;-th such source component is source-coded (quantized) using 
an optimal source-code at distortion D'f^. This codeword will be made available to both the strong and the 
weak receivers. The rate required to do this is 

^ 1 2 
fc=l 

Let the decoder reconstruction of the i-th sample be S'f,{i). 

The quantization error of the k-th source component is transmitted over the k-th sub-channel using 
power Pfc. In other words, the input to the A;-th sub-channel is 

Xk{^) = ^(^'^(^) - ^^(^))' ^ = 1, 2, . . . , n. 

To produce its reconstruction, the weak-user adds to Sl{i) the linear least-squared error estimate of 

Ski^-S',{i) from Y^^ii) 

This estimate is at an MSB estimation error of D'f./ ^1 + j . This gives the first term in the expression 
for in the theorem. The strong user also performs the same to get an intermediate reconstruction 
of the source component at MSB distortion of D'^ = D'^/ {l + -^j. This will act as a side-information 
available at the decoder for a Wyner-Ziv source coding of Sk (again footnote [2] applies). We would like to 
enhance this to a distortion of Dk in the expression for Ds in the theorem. Using an extension of Wyner 
and Ziv's result, the Wyner-Ziv bitrate needed to be delivered to the strong user is 

^ 1 , D'i 

k=i " 

b) Source components L + 1 through K' : These source components are similar to the second source 
component of the K = M = example (Fig. [6]) of section IV in that they are sent uncoded, but the sub- 




channels over which they are sent may have other bits sent using codewords. The k-th source component 
is sent uncoded on the A;-th sub-channel using power P'^ — Pj!. The rest of the power spent on this sub- 
channel is utilized to send the coded parts as will be discussed below. However, we need to note the fact 
that both decoders will be able to decode a coded part sent using power P^ — P(. on this sub-channel 
and subtract it off before estimating Sk- The rest of the coded part which is sent at power P(' will act as 
interference. Hence the MSB distortion for the k-th source component will be 



,2 



Djk = T^ZTJTT, J e {s,w}. 



1 + 



k -* fc 



This gives the corresponding terms in the expressions for Dg and D^j in the theorem. 
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c) Source components K'+l through K: These source components will have no uncoded component 
unlike the above two cases. Thus they are source coded using an optimal successive refinement code [7] 
which works at a coarse description distortion D'f^, and fine description distortion of Dj.. These give the 
corresponding terms in the expressions for and respectively in the theorem. The bitrate for the 
coarse description which will be made available to both the users is 

K 



1, cri 



k=K'+l 

and the refinement layer which will be made available only to the strong user has a bitrate of 

^ 1 D' 
^ 2 ^ Dk 

k=K'+l " 

Thus over all we need to send bits at a rate of 



yliog^+ y liog^ (16) 

k=l >^ k=K'+l « 



to both the users, and in addition bits at the rate of 



Z^2 ^ Dk ^ 2 

fe=l k=K'-\-\ 



to the strong user. In the next two steps we show how this is accomplished. 

d) Sub-channels L + 1 through K': As discussed above, on the m-th sub-channel a power of P'i.^^ — P'^ 
is used for uncoded transmission. The rest of the power is allocated as follows: Pm — Pm is used for 
sending bits to both the receivers. The rest of the power P^ is used to send bits which will be decoded 
only the strong receiver. When decoding the common bits, both receivers treat the rest of the power as 
interference. Hence the bitrate of the common part is limited by the weaker user resulting in a rate of 

K' 



E 



1 / P — P' 
'log 1+"^ 



2 V P'+N, 

m=L+l ^ m ' 1 

As mentioned earlier, upon decoding, both users will subtract the codeword corresponding to the decoded 

common bits from their received signal. To send additional bits to the strong user, we use the concept of 
dirty-paper coding [14][13]. The uncoded transmission can be thought of as Gaussian side-information 
(or "dirt") which is known at the encoder. From Costa [14, Sec. II], we know that a power allocation of 
can support a rate of 

m=L+l ^ * 

to the strong user. 

e) Sub-channels K'-\-l through M: In these sub-channels no uncoded transmission is performed. We 
use the idea of superposition coding [2] to deliver a common bitstream to both the users and in addition 
a refinement bitstream to only the stronger user. With a power allocation of P^ — P^ to the common 
bitstream and the rest P!^ to the refinement bitstream, we get the following bitrates for the common and 
refinement bit streams respectively 

M ^ / p -p' 



E 



log 1 + 



2 ° \ P' -L M 



M 

E 2^°gv' ' N. 



1 ( P' 



m 
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Thus the total rate available for sending a common bitstream to both the users is 



M 

E 



m=L+l 



log 1 + 



P - P' 

P' + N 



Condition ( [15] ) ensures that this is sufficient to handle the rate of the common bit stream in ( [T6] ). Similarly, 
the total rate available for sending an enhancement bitstream to the stronger user is 



,=L+1 ^ ^ m=K'+l ^ 



P' 

m 



which is larger than the rate of the enhancement bitstream in ( [T7| ) by condition ( |T5] ). This completes the 
proof. 
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